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Scalable shared-memory multiprocessors distribute memory among the processors and use 
scalable interconnection networks to provide high bandwidth and low latency 
communication. In addition, memory accesses are cached, buffered, and pipelined to bridge 
the gap between the slow shared memory and the fast processors. Unless carefully 
controlled, such architectural optimizations can cause memory accesses to be executed in 
an order different from what the programmer expects. The set of allowable ... 
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Understanding distributed applications is a tedious and difficult task. Visualizations based on 
process-time diagrams are often used to obtain a better understanding of the execution of 
the application. The visualization tool we use is Poet, an event tracer developed at the 
University of Waterloo. However, these diagrams are often very complex and do not provide 
the user with the desired overview of the application. In our experience, such tools display 
repeated occurrences of non-trivial commun ... 
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